# Common Voice Dataset
Whisper Kurmanji
Apache-2.0
An automatic speech recognition model for the Kurdish Kurmanji dialect, fine-tuned based on the Whisper architecture.
Speech Recognition
Safetensors Other
W
amedcj
272
1
Vlzcrz Whisper Small Japanese 2
Apache-2.0
A Japanese speech recognition model fine-tuned on the Common Voice 17.0 dataset based on openai/whisper-small
Speech Recognition
Transformers Japanese

V
vlzcrz
28
1
Whisper Large V3 Japanese 4k Steps
Apache-2.0
A speech recognition model fine-tuned on the Common Voice 16.1 Japanese dataset based on openai/whisper-large-v3, trained for 4000 steps
Speech Recognition
Transformers Japanese

W
drewschaub
94
4
Tts Thai Last Step
MIT
This is a Thai text-to-speech model based on the Tacotron2 architecture, trained using a modified Common Voice Thai dataset, with processed speech that does not retain original speaker characteristics.
Speech Synthesis Other
T
lunarlist
42
2
Tts Thai
MIT
A Thai text-to-speech model based on the Tacotron2 architecture, trained using a modified Common Voice Thai dataset
Speech Synthesis Other
T
lunarlist
54
1
Exp W2v2t Zh Cn Wavlm S596
Apache-2.0
A Chinese speech recognition model fine-tuned based on microsoft/wavlm-large, supporting Simplified Chinese, trained using the Common Voice 7.0 (zh-CN) dataset.
Speech Recognition
Transformers

E
jonatasgrosman
22
1
Exp W2v2t Ja Xlsr 53 S109
Apache-2.0
Japanese automatic speech recognition model fine-tuned based on facebook/wav2vec2-large-xlsr-53, trained using Common Voice 7.0 Japanese dataset
Speech Recognition
Transformers Japanese

E
jonatasgrosman
20
0
Wav2vec2 Large Xls R 300m Hindi Epochs15 Colab
Apache-2.0
This is a speech recognition model fine-tuned on the Common Voice dataset based on the facebook/wav2vec2-xls-r-300m model, supporting Hindi.
Speech Recognition
Transformers

W
vai6hav
17
0
Wav2vec2 Common Voice Tr Demo Dist
Apache-2.0
This model is an automatic speech recognition (ASR) model fine-tuned on the COMMON_VOICE - TR Turkish dataset based on facebook/wav2vec2-large-xlsr-53, achieving a word error rate of 0.3242 on the evaluation set.
Speech Recognition
Transformers Other

W
cromz22
26
0
Wav2vec2 Large Xls R 300m Turkish Colab Common Voice 8 4
Apache-2.0
This model is a speech recognition model fine-tuned on the Common Voice Turkish dataset, based on Facebook's wav2vec2-xls-r-300m model.
Speech Recognition
Transformers

W
husnu
19
0
Wav2vec2 Large Xls R 300m German With Lm
Apache-2.0
A speech recognition model fine-tuned on the Common Voice German dataset based on facebook/wav2vec2-xls-r-300m, integrated with an n-gram language model, achieving a word error rate of 8.8%
Speech Recognition
Transformers

W
mfleck
26
1
Wav2vec2 Base Cv 10000
Apache-2.0
A speech recognition model fine-tuned on the Common Voice dataset based on wav2vec2-base-cv, achieving a word error rate of 36.84% on the evaluation set.
Speech Recognition
Transformers

W
jiobiala24
28
0
Wav2vec2 Base Checkpoint 12
Apache-2.0
This model is a fine-tuned version based on wav2vec2-base-checkpoint-11.1 on the Common Voice dataset, primarily used for speech recognition tasks.
Speech Recognition
Transformers

W
jiobiala24
16
0
Wav2vec2 Xlsr Romansh Sursilvan
Apache-2.0
This model is an automatic speech recognition model fine-tuned on the Romansh-Sursilvan dialect dataset based on facebook/wav2vec2-xls-r-1b, achieving a word error rate (WER) of 13.82% on the Common Voice 8 test set.
Speech Recognition
Transformers

W
sammy786
18
0
Wav2vec2 Large Xls R 300m Hausa
Apache-2.0
This is an automatic speech recognition model fine-tuned on Hausa speech data based on facebook/wav2vec2-xls-r-300m
Speech Recognition
Transformers Other

W
infinitejoy
22
1
Wav2vec2 Xls R 300m Zh TW
Apache-2.0
This is a Chinese-Taiwan speech recognition model fine-tuned on the COMMON_VOICE - ZH-TW dataset based on the facebook/wav2vec2-xls-r-300m model
Speech Recognition
Transformers

W
StevenLimcorn
58
1
Wav2vec2 Base Checkpoint 14
Apache-2.0
A speech recognition model based on the wav2vec2 architecture, fine-tuned on the Common Voice dataset
Speech Recognition
Transformers

W
jiobiala24
16
0
Xls R Eng
Apache-2.0
This is a small random robustness model based on the wav2vec2 architecture, fine-tuned on the MOZILLA-FOUNDATION/COMMON_VOICE_7_0 - AB dataset for automatic speech recognition tasks.
Speech Recognition
Transformers Other

X
mattchurgin
13
0
Tts Transformer Ar Cv7
A Transformer-based text-to-speech model using fairseq S^2, supporting Arabic single male speaker synthesis
Speech Synthesis Arabic
T
facebook
53
8
Wav2vec2 Xlsr Multilingual 56
Apache-2.0
This is a multilingual automatic speech recognition (ASR) model supporting 56 languages, fine-tuned from facebook/wav2vec2-large-xlsr-53 on the Common Voice dataset.
Speech Recognition
Transformers Supports Multiple Languages

W
voidful
21.69k
30
Wav2vec2 Base Turkish Cv7
Apache-2.0
Turkish automatic speech recognition model based on wav2vec2 architecture, fine-tuned on the Common Voice 7.0 Turkish dataset
Speech Recognition
Transformers Other

W
cahya
21
0
Wav2vec2 Large Xlsr 53 Dutch
Apache-2.0
An automatic speech recognition model fine-tuned on the Dutch Common Voice dataset based on facebook/wav2vec2-large-xlsr-53, achieving a test WER of 17.09%.
Speech Recognition
Transformers Other

W
wietsedv
44
1
Wav2vec2 Large Xlsr Persian V2
Apache-2.0
An automatic speech recognition model fine-tuned on Persian (Farsi) using the Common Voice dataset, based on facebook/wav2vec2-large-xlsr-53
Speech Recognition Other
W
m3hrdadfi
47
6
Wav2vec2 Hausa2 Demo Colab
Apache-2.0
This model is a Hausa speech recognition model fine-tuned on the Common Voice dataset based on facebook/wav2vec2-large-xlsr-53
Speech Recognition
Transformers

W
Arnold
19
1
Wav2vec2 Xls R 300m Arabic
Apache-2.0
This is an automatic speech recognition (ASR) model fine-tuned on the Arabic Common Voice 7 dataset based on the facebook/wav2vec2-xls-r-300m model.
Speech Recognition
Transformers Arabic

W
AndrewMcDowell
148
0
Featured Recommended AI Models